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ABSTRACT 

In searches for planetary transits in the field, well over half of the survey stars 
are typically giants or other stars that are too large to permit straightforward 
detection of planets. For all-sky searches of bright < 11 stars, the fraction is 
~ 90%. We show that the great majority of these contaminants can be removed 
from the sample by analyzing their reduced proper motions (RPMs): giants have 
much lower RPMs than dwarfs of the same color. We use Hipparcos data to design 
a RPM selection function that eliminates most evolved stars, while rejecting only 
9% of viable transit targets. Our method can be applied using existing or soon- 
to-be-released all-sky data to stars V < 12.5 in the northern hemisphere and 

< 12 in the south. The method degrades at fainter magnitudes, but does 
so gracefully. For example, at = 14 it can still be used to eliminate giants 
redward of — / ~ 0.95, that is, the blue edge of the red giant clump. 

Subject headings: astrometry - planets - methods: statistical 



1. Introduction 



Spurred on by the detection of the transiting planetary companion to HD209458 (Char- 
bonneau et al. 2000) as well as the exciting scientific results that can be extracted from 
intensive foUowup of this object (Charbonneau et al. 2002; Cody & Sasselov 2002; Hui & 
Seager 2002), a large number of surveys are underway to detect planetary transits of stars 
both in clusters (Street et al. 2000) and the field (Howell et al. 2000; Brown & Charbonneau 
1999; Mallen-Ornelas 2001; Udalski et al. 2002; Street et al. 2002). The expected signature 
of a planetary transit, a ~ 1% drop in the stellar fiux over a duration of several hours, can be 
mimicked by a variety of non-planetary phenomena. These include transits by brown dwarfs 
or late M dwarfs, which have sizes similar to Jovian planets, grazing eclipses by ordinary 
stars, full eclipses by binaries that are ~ 100 times fainter than the target star but lie in the 
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same spatial resolution element, and transits of evolved stars by main-sequence stars. While 
it is sometimes possible to distinguish grazing eclipses from the flatter-bottomed transits 
using lightcurves of sufficient photometric precision, rejection of stellar eclipses and transits 
usually requires several spectra: an M star companion induces radial velocity (RV) variations 
of ~ lOkms"^, whereas planetary perturbations are 10 to 100 times smaller. 

Since obtaining these spectra is expensive in both telescope time and hiiman effort, 
it is important to seek other robust methods of recognizing non-planetary transits. Here 
we show how the great majority of evolved stars can be eliminated from the target list of 
transit searches using reduced proper motion (RPM) diagrams. Field star transit surveys 
usually contain one to several times more evolved stars than main-sequence stars, while only 
the latter present useful targets. Hence, robust rejection of evolved-star contamination of 
transit surveys should result in substantial gains in efficiency in both the data analysis and 
the follow-up observations required for confirmation. 

In § 2, we review the basic physical principles that allow one to recognize evolved stars 
from a RPM diagram. In § 3, we assemble an an all-sky catalog of transit targets < 11. 
We show that our RPM criteria reject about 60% of the stars in this magnitude range that 
remain after an initial 25% are already rejected because their colors are too blue. That is, 
altogether 70% are rejected. In § 4, we establish the criteria for rejecting evolved stars using 
a combination of Tycho-2 (H0g et al. 2000) and 2MASS (Skrutskic ct al. 1997) data. These 
criteria can be applied to stars y < 12 over the whole sky once the full 2MASS catalog is 
released. In § 5, we show how the same technique can be extended to V" ~ 12.5 over the 
northern sky by combining 2MASS and UCAC (Zacharias et al. 2000) data, supplemented 
with data from the transit surveys themselves. We also show that the method degrades 
gracefully at fainter magnitudes, and can still be used to eliminate the majority of evolved 
stars at V — 14. While most of the paper focuses on the problem of selecting targets for 
transits of amplitude ~ 1%, we consider the possibility in § 6 that smaller amplitudes might 
be reached for brighter subsamples of the main survey. In this way, efficient transit searches 
might be extendable to stars of somewhat larger radius. Finally, in § 7, we briefly discuss 
our results. 



2. Principles 

Most ground-based transit surveys aim to detect transits with depths of about 1%, 
roughly the fraction of solar light blocked by Jupiter. Since planets, or even brown dwarfs, 
are not expected to get much larger than Jupiter, this level of sensitivity limits the parent-star 
population that can be probed with transits to those with radii r < r©. Early main-sequence 
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(MS) stars and stars that have evolved significantly off the MS are not useful: 1% transits 
would then be due to other stars, not planets. 

Broadly speaking, the contaminants can be divided into three spectral groups: early- 
type, mid- type (roughly "G-type"), and late-type. It is straightforward to eliminate early- 
type stars using a simple color cut, and in fact most survey teams already do this. Late-type 

stars come in two distinct classes, giants and dwarfs, which are separated by several magni- 
tudes in absolute magntitude. In general, the parallaxes (and hence absolute magnitudes) 
of these stars are unknown, but if their proper motions are measured with sufficient accu- 
racy, these classes can nevertheless be distinguished using a reduced proper motion (RPM) 
diagram. The V band RPM, Vrpm, is related to My by 

yRPM = y + 51ogAt = My + 51og (1) 

47.4 km s 

where /i is the proper motion in arcsecyr^^ and v± is the transverse velocity. Hence, a 
RPM diagram looks similar to a standard color-magnitude diagram (CMD) but with greater 
scatter owing to the dispersion in v±. However, for colors at which the giants and dwarfs 
are separated by > 3 mag, this scatter is not sufficient to confuse the two populations. 

Contamination from evolved G stars poses the most difficult problem. The MS and 
evolved populations are typically separated by just a couple of magnitudes. This is enough 
to make the evolved stars too large to permit transit detections, but not large enough to easily 
distinguish them from dwarfs of the same color using a RPM diagram. Some contamination 
from evolved G stars is therefore inevitable. 

Figure 1 is a GMD of Hipparcos stars within the completeness limit V < 7.3 and further 
restricted to parallax errors below 5% and photometry errors below 0.1 mag in each band. 
The solid line shows the location of stars with estimated radii r = 1.25 r©, for which we use 
the formula, 

r Mv 
log — = 0.597 + 0.536(St - Ft) —■ (2) 

r@ 5 

Here, Bt and Vt are the Hipparcos passbands. To find the slope in this equation, we 
combine the van Belle (1999) scahng, logr oc 0.064(y — K) — 0.2 Mk, with the standard 
Hipparcos (ESA 1997) transformation {V — Vt) — —0.09{Bt — Vt), and a transformation 
(Vt — K) — 2.0525(Sr — Vt) + 0.1588 empirically determined by matching Hipparcos and 
2MASS (Skrutskie et al. 1997) data over the relevant range 0.4 < {Bt - Vt) < 0.8. We 
normalize the zero point to the radius of the Sun, assuming it has {Bt — Vt)q = 0.71 
and My^, = 4.89, which we obtain from the ESA (1997) transformations starting with 
{B - V)q - 0.65, Mv,Q = 4.83. 

In Figure 2, we plot the RPMs of all the stars from Figure 1, the upper and lower 
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panels showing respectively the stars with large (r > 1.25 r©) and small (r < 1.25 r©) radii. 
The object now is to design color/RPM criteria that will select the overwhelming majority 
of small stars while rejecting as many large stars as possible. The broken lines in the two 
panels show our attempt to define such a selection function. 

What properties must a dataset have to allow one to apply this selelction function to 
it? The most important criterion is that stars with r ~ 1.25 (i.e., My ~ 4) must have 
proper motion errors that are substantially smaller than their typical proper motions. If 
their proper motions were measured to be consistent with zero, it would not be possible 
to distinguish them from evolved stars of the same color and apparent magnitude, which 
would then also have measured proper motions consistent with zero. Disk stars have typical 
transverse velocities v± ~ 40kms~^. Therefore stars with My — 4 have typical proper 
motions, 

//~21masyr-ixl0°-2(i2-^). (3) 

Second, the colors must be accurate enough to minimize scatter across the color boundary 
at {Bt - Vt) = 0.45. 

3. Application to the Tycho-2 Catalog 

An obvious choice for such a dataset is the Tycho-2 catalog (H0g et al. 2000). At 
V — 12, Tycho-2 has typical proper motion errors of ~ 3.5masyr~^, which according 
to equation (3) is quite adequate. Tycho-2 is also complete to F ~ 12. However, Tycho-2 
photometry, particularly Bt photometry is quite poor at these faint magnitudes. In our 
initial treatment, we will therefore restrict consideration to Tycho-2 stars with Vr < 11, 
for which cr^ < 2.5masyr~^. To avoid clutter, we display in Figure 3 a random 2% subset 
of these stars. Table 1 shows the total number of stars in the sample that are accepted 
and rejected, with percentages broken down into early-type {Bt — Vt < 0.45), mid-type 
(0.45 < Bt - Vt < 0.80), and late-type {Bt - Vt > 0.80) stars. If we ignore the stars 
that are rejected solely because they were too blue {Bt — Vt < 0.45) (on the grounds that 
such color selection was already being applied prior to our work), we find that 60% of the 
remaining stars are rejected by our selection criteria. 

Of course, a successful selection will not only reject many unwanted stars, but also keep 
the overwhelming majority of desirable stars. It will also minimize the number of unwanted 
stars that are accepted. We must use the stars in Figure 2 to evaluate performance in these 
two areas, since it is only this sample for which we have independent knowledge of whether 
an individual star is desirable. This will require some care because Figures 2 and 3 are 
selected in somewhat analogous, but not identical, ways. 
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The fraction of stars falsely rejected in Figure 2 is only 9% (50/ [536 +50]). This is 
likely to be a pretty accurate estimate of the false rejection rate for the Tycho-2 sample. 
Since all the stars in question have My > 4, the Hipparcos sample lies within ~ 50 pc. Since 
Hipparcos parallax errors at these bright magnitudes arc < 1 mas, the fractional errors arc 
< 5%. Hence the 5% parallax-error criterion does not substantially affect the sample of 
stars r < 1.25 r©. Thus, both the Hipparcos and Tycho-2 samples are effectively magnitude- 
hmited. Because the Tycho-2 sample has a fainter magnitude limit, it has a shghtly lower 
fraction of stars near the maximum allowed radius, since these stars are seen to a distance 
~ 250 pc from the Galactic plane where their density thins out somewhat. Hence, the 9% 
false rejection rate is probably a slight overestimate. 

The rate of unwanted acceptances is 65% (985/ [985-1-536]). This is likely to be somewhat 
underestimated because the unwanted stars are intrinsically brighter than the desired stars 
and so have suffered more from the bias induced by demanding good parallaxes. This bias is 
probably not extremely severe because to be accepted, the unwanted star must be reasonably 
close to the boundary. 

Note that, since 70% of stars are rejected and only < 35% of those accepted are useful, 
the fraction of stars in the overall magnitude-limited sample that are useful is only 10%. 

Table 1 also shows the results for a sample limited to Vt < 10, but otherwise selected 
according to the same criteria. This would be similar to the sample probed by the all-sky 1" 
telescope survey proposed by Pepper, Gould, & DePoy (2002). In addition. Table 1 shows a 
< 10 sample selected according to looser criteria as described in § 6. 

4. Application to Tycho-2 + 2MASS 

The results in § 3 were restricted to < 11. Since most transit surveys go beyond this 
limit, we would like to push the method to fainter stars. Recall from equation (3) that at 
V — 12, the Tycho-2 astrometry errors are adequate (3.5masyr~^) but the Bt photometry 
is unreliable. A straightforward way to resolve this problem is to match the Tycho-2 catalog 
to 2MASS and so replace {Bt - Vt) by {Vt -J). At these magnitudes, the 2MASS J errors 
are typically < 0.03 mag and so negligible compared to Tycho-2 errors. Also, although the 
Vt errors do deteriorate significantly for these faint stars, the effects of this are mitigated 
by the longer wavelength baseline: d{VT — J)/d{BT — Vt) ~ 1.73. Unfortunately, the full 
2MASS catalog is not yet available, but it is promised by the end of 2002. In the meantime, 
the Second Incremental 2MASS Release, which covers 47% of the sky, allows us to evaluate 
the efficiency of this approach. 
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Figure 4 is a Tycho-2/2MASS RPM diagram, with Vt,rpm determined from Tycho-2, 
and colors from Tycho-2 Vt and 2MASS J. A random 4.25%(= 2%/47%) of the data 
are shown so that the density of points can be compared directly to that of Figure 3. 
The broken line discriminates between transit target stars and non-target stars. The end 
points of these line segments are found by transforming the coordinates of the line seg- 
ments in Figures 2 and 3. To determine this transformation, we fit (Vr — J) of Hip- 
parcos/2MASS stars to a fifth-order polynomial of {Bt — Vt). This fit has coefficients 
(0.064677,1.915935,-0.034922,-0.676674,0.457714,-0.075894) with a residual scatter of 
0.113 mag. No magnitude limit is applied to Figure 4. (Note that the numbers of accepted 
and rejected stars have been scaled up by a factor 2.13 in Table 1 to take account of the fact 
that 2MASS presently covers only 47% of the sky.) 



5. Fainter Surveys 

With the advent of UCAC2 (Zacharias et al. 2000), it should be possible to push our 
method to about V — 12.5, at least in the north (S > — 2.°5). For these declinations, UCAC2 
expects to be able to fink up modern CCD astrometry with archival AGK2 plates {B < 13) 
to achieve Imasyr"^ proper motions (N. Zacharias 2002, private communiation) . For the 
most critical colors, {B — V) ~ 0.5, this corresponds to V = 12.5. Such high astrometric 
precision would easily satisfy the requirements of equation (3). Unfortunately, UCAC2 does 
not expect their CCD photometry to be calibrated, and there is no other source of all-sky 
optical CCD photometry at these magnitudes. However, a transit survey could calibrate its 
own relative photometry to achieve the necessary color selection. 

At fainter magnitudes, it is still possible to partially apply our method to early-type 
and late-type stars, if not to G-type stars. Of course, as has been mentioned several times, 
one can exclude early- type stars based on color data alone, i.e., without proper motions. 
RPM discrimination could then be apphed to late-type stars, with a color threshold that 
depended on the depth of the survey. First note that even for very faint stars {V < IS) it is 
possible to obtain proper motions with errors cr^u ~ 5.5masyr~^ for 5 > —33° by matching 
USNO-A (Monet 1996, 1998) to 2MASS (Salim & Gould 2003), and UCAC2 may be able 
to improve somewhat on this value for slightly brighter stars (i? < 16). Suppose a survey 
star has magnitude V and color V — I. If the star were a dwarf, then it would have absolute 
magnitude My = 3.37{V — /) + 2.89 (Rcid 1991). A tranverse velocity of v± ~ 40kms~^ 
would then be detectable at the 4 a level provided that. 
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Hence, for a survey with limiting magnitude V — lA such as the Kepler mission^ , it would still 
be possible to discriminate between giants and dwarfs ioiV — I > 0.95, roughly the blue edge 
of the clump. The great majority of stars in magnitude-limited samples at these red colors 
are giants. Hence, it is important to have an efficient mechanism for sifting through these 
to find the modest number of dwarfs that are viable targets of a transit search. (Presently, 
Kepler plans to make this selection by high-resolution spectroscopy of ~ 10^ stars.) 

6. Selection of Larger Stars 

Up to this point, we have implicitly assumed that the transit survey is systematics- 
limited, so that there is no point in probing stars larger than some radius limit, for which 
wc have adopted r = 1.25 r0. However, wc also wish to consider the situation in which a 
survey is photon-limited rather than systematics-limited. In this case, if the survey were 
senstitive to r = 1.25 Tq at its magnitude limit, then it would be sensitive to r = 1.57 one 
magnitude brighter. In the example of § 3, the search among these bigger stars would be 
restricted to Vy < 10. The dashed fine in Figure 1 is the r = 1.57 contour. Figure 5 shows 
the derivation of RPM selection criteria for this case. It is similar to Figure 2 except that 
the stars are divided at r = 1.57 rather than r = 1.25 r©. Figure 6 applies this criterion 
to the Vt < 10 sample. It is similar to Figure 3 except for the brighter magnitude limit and 
larger radius selection. The method is slightly less efficient for these bigger stars, rejecting 
only 57% of the stars remaining after a blue color cut has removed an initial 22%. Recall 
that the corresponding numbers for the smaller stars were 60% and 25%. 

7. Discussion 

The RPM selection function is very efficient at removing early-type and late-type stars. 
For the former, it reduces to a simple color cut that is similar to the ones already in common 
use. For the latter, its robustness derives from the large gap between the giant branch and 
the MS. The method is relatively inefficient at excluding evolved G-type stars because here 
the gap is much smaller. However, this residual G-star contamination is modest relative to 
the contaminants that are efficiently eliminated. 

Given that the primary new gain from this method is the elimination of red giant 
contaminants, one might ask how difficult it would be to weed these out by other means. 



http://www.kepler.arc.nasa.gov/ 
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Stellar transits of giants must be very finely tuned in order to mimic planetary transits. For 
example, the eclipse due to a star in a 0.1 AU (12 day) orbit about a 10 giant would last 
~ 1 day if the inclination were anywhere near 90°. Only if it traced a chord within 1% of 
the giant's limb would the eclipse be short enough to be confused with a planetary transit. 
If the data were of sufficient quality, the rounded shape of the lightcurve could reveal the 
stellar nature of the event. 

A more difficult problem is caused by triple systems containing a giant and a MS eclips- 
ing binary. If the binary separation is bigger than the radius of the giant, it is extremely 
difficult to distinguish this system from a planet transiting a dwarf star using the lightcurve 
alone. Indeed, even spectroscopic observations designed to recognize the ~ lOkms^^ reflex 
motion due to an M-star companion would most likely fail to detect such a contaminant 
because the giant would not be markedly changing its RV. Only a high signal-to-noise ra- 
tio spectrum would succeed in detecting the spectroscopic signature of the companion pair 
contributing ~ 1% of the light. These difficult contaminants are easily removed using our 
method. 

Perhaps the best aspect of our method in this respect is that it not only removes the 
need for foUowup observations of contaminant events, it removes the need to even analyze 
the lightcurves of the great majority of contaminating stars. 

In this paper we have ignored extinction. The effect of extinction (if it is not corrected) 
is to move stars along the reddening vector in the RPM diagram. This can essentially only 
move stars from being rejected to accepted and not the reverse. That is, ignoring extinction 
is conservative in that it adds to contamination but does not lead to missing legitimate 
targets. 

For our primary < 11 example, shown in Figure 3, extinction is actually a very 
minor effect: target stars My^ > 4 always lie < 250 pc, so that even in the Galactic plane 
the reddening is only E{Bt — Vr) < 0.06. Hence, ignoring extinction has hardly any effect. 

As the magnitude limit gets fainter, extinction becomes more important and so failure 
to correct for it can lead to significant increases in contamination. We caution, however, 
that accurately correcting for this effect is not trivial, and that such corrections should 
therefore be done conservatively, i.e., by maintaining a strong bias against overestimating 
the extinction. Even for fields for which the extinction at infinity is well measured (from e.g. 
Schlegel, Finkbeiner, & Davis 1998), a large fraction of the dust may lie behind the target 
stars, which tend to be relatively close. Thus, the 3-dimensional dust distribution must be 
modeled. Moreover, for fields close to the Galactic plane, i.e., those for which the extinction 
correction is most important, the Schegel et al. (1998) estimates for extinction at infinity are 
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less reliable. Hence, even more care is required in making the correction. 

This publication makes use of catalogs from the Astronomical Data Center at NASA 
Goddard Space Flight Center, VizieR and SIMBAD databases operated at CDS, Strasbourg, 
Prance, and data products from the Two Micron All Sky Survey, which is a joint project of 
the University of Massachusetts and the IPAC/Caltech, funded by the NASA and the NSF. 
This work was supported by grant AST 02-01266 from the NSF. 
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Table 1. Tycho-2 RPM Transit Selection 
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Fig. 1. — Hipparcos color-magnitude diagram (CMD) restricted to stars V < 7.3 with 
parallax errors smaller than 5% and photometry errors smaller than 0.1 mag. The solid and 
dashed lines arc contours of constant radius, respectively r = 1.25 and r = 1.57 Tq. Our 
primary objective is to design criteria that efficiently select stars below the solid line, but 
without making use of parallax information. 
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Fig. 2. — Hipparcos reduced proper motion (RPM) diagram. The upper and lower panels 
show respectively the stars above and below the solid (r = 1.25 line in Fig. 1. The 
broken lines (same in both panels) show our adopted selection criteria for distinguishing 
stars r < 1.25 r©. The false rejection rate is about 9%. The contamination rate appears 
to be about 65%, but is somewhat underestimated because the underlying sample is biased 
against intrinsically bright stars (My^ > 3.3) relative to a magnitude-limited sample. 
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Fig. 3. — Tycho-2 RPM diagram restricted to stars Vr < 11- Only 2% of stars are displayed 
to avoid clutter. The broken-line selection function (same as Fig. 2) rejects 70% of all stars 
including 25% early-type (rejected on color grounds alone), 5% "G-type" (0.45 < Bt — Vr < 
0.80), and 40% late-type. 



^° Tycho-2/2MASS Reduced Proper Motions 

12 3 4 

V^-J 

Fig. 4. — Tycho-2/2MASS RPM diagram. Similar to Fig. 3 except that by replacing Tycho- 
2 Bt with 2MASS J, we are able to extend coverage to the Tycho-2 limit, approximately 
V ^ 12. The brokcn-linc selection function is computed by transforming the broken- line in 
Fig. 3 using a {Bt — Vr) / (Vr — J) fifth order color-color relation. Because the scatter in this 
relation is only 0.11 mag, the selection is extremely similar to the Tycho-2-only selection. 
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Fig. 5. — Hipparcos RPM diagram. Similar to Fig. 2 except that the division between 
two panels is now done at r = 1.57 (dashed hne in Fig. 1). The broken hne shows 
our adopted criteria to select stars smaller than this radius. The false rejection rate is 
42/(1157 + 42) = 4%. The contamination rate is probably somewhat underestimated at 
882/(1157 + 882) =43%. 




Fig. 6. — Tycho-2 RPM diagram. Similar to Fig. 3 except that first, the sample is restricted 
to Vt < 10 and second, the broken line, which is taken from Fig. 5, is designed to select 
stars r < 1.57 r©. The efficiency of selection is qualitatively similar to the case for smaller 
stars (Fig. 3). 



